Name | Version | Summary | date |
parakeet-mlx |
0.3.5 |
An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX. |
2025-07-13 17:19:58 |
achatbot |
0.0.21.post0 |
An open source chat bot for voice (and multimodal) assistants |
2025-07-13 14:15:50 |
voice-mode-azure |
2.13.0 |
VoiceMode with Azure OpenAI support - Voice interaction capabilities for AI assistants |
2025-07-13 01:42:09 |
vocals |
1.0.984 |
A Python SDK for voice processing and real-time audio communication |
2025-07-11 04:52:57 |
text-to-speech-api |
2025.0.2 |
image-upscaling.net api client |
2025-07-10 23:23:25 |
modelscope |
1.28.0 |
ModelScope: bring the notion of Model-as-a-Service to life. |
2025-07-09 04:16:56 |
nemo-toolkit |
2.3.2 |
NeMo - a toolkit for Conversational AI |
2025-07-08 22:29:27 |
pyttsx3 |
2.99 |
Text to Speech (TTS) library for Python 3. Works without internet connection or delay. Supports multiple TTS engines, including Sapi5, nsss, and espeak. |
2025-07-08 12:24:21 |
senselab |
0.29.1 |
Senselab is a Python package that simplifies building pipelines for speech and voice analysis. |
2025-02-24 22:51:54 |
patkit |
0.14.1 |
Phonetic Analysis ToolKit: Tools for processing phonetic data |
2025-02-19 07:03:56 |
podonos |
0.11.0 |
Managed evaluation for audio & speech |
2025-02-19 00:59:14 |
agi-open-network-cn |
0.1.0 |
AGI Open Network China Models - A Simple and Powerful Framework for Chinese AI Models |
2025-02-02 11:28:57 |
phonexia-gender-identification-client |
1.3.1 |
Client script for communicationg with Phonexia gender identification microservice. |
2025-01-27 08:36:12 |
phonexia-enhanced-speech-to-text-built-on-whisper-client |
1.8.0 |
Client for communication with Phonexia Enhanced Speech To Text Built On Whisper microservice. |
2025-01-21 20:01:00 |
hume |
0.7.6 |
A Python SDK for Hume AI |
2025-01-15 21:09:16 |
pyobjc-framework-Speech |
11.0 |
Wrappers for the framework Speech on macOS |
2025-01-14 19:05:38 |
pnm |
0.0.1 |
Convert audio to phonetic text and practice improving your speech accent. |
2025-01-14 04:51:36 |
tetos |
0.4.2 |
Unified interface for multiple Text-to-Speech (TTS) providers |
2025-01-08 09:52:44 |
phonexia-grpc |
2.9.0 |
Library for communication with microservices developed by phonexia using grpc application interface. |
2025-01-08 09:45:12 |
inaSpeechSegmenter |
0.7.14 |
CNN-based audio segmentation toolkit. Does voice activity detection, speech detection, music detection, noise detection, speaker gender recognition. |
2025-01-06 17:45:13 |